Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 150451 |
| Missing cells | 301974 |
| Missing cells (%) | 12.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 18.4 MiB |
| Average record size in memory | 128.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 6 |
Match_Date has a high cardinality: 450 distinct values | High cardinality |
Player_Out is highly correlated with Striker | High correlation |
Striker is highly correlated with Player_Out | High correlation |
Striker_Batting_Position has 13861 (9.2%) missing values | Missing |
Player_Out has 143013 (95.1%) missing values | Missing |
Fielders has 145100 (96.4%) missing values | Missing |
Batsman_Runs_Scored has 61151 (40.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-11-09 10:42:53.332133 |
|---|---|
| Analysis finished | 2021-11-09 10:43:17.694374 |
| Duration | 24.36 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
MatcH_id
Real number (ℝ≥0)
| Distinct | 636 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 636207.5251 |
|---|---|
| Minimum | 335987 |
| Maximum | 1082650 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 335987 |
|---|---|
| 5-th percentile | 336019 |
| Q1 | 419154 |
| median | 548382 |
| Q3 | 829742 |
| 95-th percentile | 1082617 |
| Maximum | 1082650 |
| Range | 746663 |
| Interquartile range (IQR) | 410588 |
Descriptive statistics
| Standard deviation | 234362.2892 |
|---|---|
| Coefficient of variation (CV) | 0.368373966 |
| Kurtosis | -0.8701765653 |
| Mean | 636207.5251 |
| Median Absolute Deviation (MAD) | 156170 |
| Skewness | 0.5973540118 |
| Sum | 9.571805835e+10 |
| Variance | 5.49256826e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 392195 | 267 | 0.2% | |
| 1082625 | 263 | 0.2% | |
| 729320 | 262 | 0.2% | |
| 829742 | 261 | 0.2% | |
| 598009 | 261 | 0.2% | |
| 829816 | 259 | 0.2% | |
| 419126 | 259 | 0.2% | |
| 829746 | 258 | 0.2% | |
| 598022 | 258 | 0.2% | |
| 419147 | 257 | 0.2% | |
| Other values (626) | 147846 | 98.3% |
| Value | Count | Frequency (%) | |
| 335987 | 225 | 0.1% | |
| 335988 | 248 | 0.2% | |
| 335989 | 219 | 0.1% | |
| 335990 | 246 | 0.2% | |
| 335991 | 240 | 0.2% |
| Value | Count | Frequency (%) | |
| 1082650 | 248 | 0.2% | |
| 1082649 | 207 | 0.1% | |
| 1082648 | 157 | 0.1% | |
| 1082647 | 252 | 0.2% | |
| 1082646 | 249 | 0.2% |
Over_id
Real number (ℝ≥0)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.14270427 |
|---|---|
| Minimum | 1 |
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 10 |
| Q3 | 15 |
| 95-th percentile | 19 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.674254966 |
|---|---|
| Coefficient of variation (CV) | 0.5594420202 |
| Kurtosis | -1.18115921 |
| Mean | 10.14270427 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.05347944711 |
| Sum | 1525980 |
| Variance | 32.19716942 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 8092 | 5.4% | |
| 2 | 8017 | 5.3% | |
| 3 | 7931 | 5.3% | |
| 4 | 7901 | 5.3% | |
| 5 | 7873 | 5.2% | |
| 6 | 7864 | 5.2% | |
| 7 | 7826 | 5.2% | |
| 8 | 7799 | 5.2% | |
| 9 | 7775 | 5.2% | |
| 10 | 7726 | 5.1% | |
| Other values (10) | 71647 | 47.6% |
| Value | Count | Frequency (%) | |
| 1 | 8092 | 5.4% | |
| 2 | 8017 | 5.3% | |
| 3 | 7931 | 5.3% | |
| 4 | 7901 | 5.3% | |
| 5 | 7873 | 5.2% |
| Value | Count | Frequency (%) | |
| 20 | 5648 | 3.8% | |
| 19 | 6542 | 4.3% | |
| 18 | 6979 | 4.6% | |
| 17 | 7233 | 4.8% | |
| 16 | 7332 | 4.9% |
Ball_id
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.616639304 |
|---|---|
| Minimum | 1 |
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.807638431 |
|---|---|
| Coefficient of variation (CV) | 0.4998116425 |
| Kurtosis | -1.081746879 |
| Mean | 3.616639304 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.09679432471 |
| Sum | 544127 |
| Variance | 3.267556698 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 24388 | 16.2% | |
| 2 | 24330 | 16.2% | |
| 3 | 24261 | 16.1% | |
| 4 | 24202 | 16.1% | |
| 5 | 24123 | 16.0% | |
| 6 | 24041 | 16.0% | |
| 7 | 4324 | 2.9% | |
| 8 | 679 | 0.5% | |
| 9 | 103 | 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 24388 | 16.2% | |
| 2 | 24330 | 16.2% | |
| 3 | 24261 | 16.1% | |
| 4 | 24202 | 16.1% | |
| 5 | 24123 | 16.0% |
| Value | Count | Frequency (%) | |
| 9 | 103 | 0.1% | |
| 8 | 679 | 0.5% | |
| 7 | 4324 | 2.9% | |
| 6 | 24041 | 16.0% | |
| 5 | 24123 | 16.0% |
Innings_No
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 43 |
| 4 | 38 |
| Value | Count | Frequency (%) | |
| 1 | 78024 | 51.9% | |
| 2 | 72346 | 48.1% | |
| 3 | 43 | < 0.1% | |
| 4 | 38 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Team_Batting
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 7 | |
|---|---|
| 2 | |
| 4 | |
| 3 | |
| 6 | |
| Other values (16) |
| Value | Count | Frequency (%) | |
| 7 | 16988 | 11.3% | |
| 2 | 16140 | 10.7% | |
| 4 | 15991 | 10.6% | |
| 3 | 15821 | 10.5% | |
| 6 | 15481 | 10.3% | |
| 1 | 15416 | 10.2% | |
| 5 | 13845 | 9.2% | |
| 8 | 9033 | 6.0% | |
| 11 | 7379 | 4.9% | |
| 10 | 5443 | 3.6% | |
| Other values (11) | 18914 | 12.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 27 |
|---|---|
| Median length | 1 |
| Mean length | 2.711779915 |
| Min length | 1 |
Team_Bowling
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 7 | |
|---|---|
| 2 | |
| 4 | |
| 1 | |
| 6 | |
| Other values (16) |
| Value | Count | Frequency (%) | |
| 7 | 16704 | 11.1% | |
| 2 | 16416 | 10.9% | |
| 4 | 15776 | 10.5% | |
| 1 | 15585 | 10.4% | |
| 6 | 15534 | 10.3% | |
| 3 | 15493 | 10.3% | |
| 5 | 14177 | 9.4% | |
| 8 | 9038 | 6.0% | |
| 11 | 7276 | 4.8% | |
| 10 | 5457 | 3.6% | |
| Other values (11) | 18995 | 12.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 27 |
|---|---|
| Median length | 1 |
| Mean length | 2.713069371 |
| Min length | 1 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 13861 |
| Missing (%) | 9.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.583637162 |
|---|---|
| Minimum | 1 |
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.145089808 |
|---|---|
| Coefficient of variation (CV) | 0.5985789605 |
| Kurtosis | 0.1815855545 |
| Mean | 3.583637162 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7946178026 |
| Sum | 489489 |
| Variance | 4.601410282 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 25584 | 17.0% | |
| 1 | 25476 | 16.9% | |
| 3 | 23439 | 15.6% | |
| 4 | 20435 | 13.6% | |
| 5 | 16627 | 11.1% | |
| 6 | 10844 | 7.2% | |
| 7 | 6633 | 4.4% | |
| 8 | 3834 | 2.5% | |
| 9 | 2126 | 1.4% | |
| 10 | 1160 | 0.8% | |
| (Missing) | 13861 | 9.2% |
| Value | Count | Frequency (%) | |
| 1 | 25476 | 16.9% | |
| 2 | 25584 | 17.0% | |
| 3 | 23439 | 15.6% | |
| 4 | 20435 | 13.6% | |
| 5 | 16627 | 11.1% |
| Value | Count | Frequency (%) | |
| 11 | 432 | 0.3% | |
| 10 | 1160 | 0.8% | |
| 9 | 2126 | 1.4% | |
| 8 | 3834 | 2.5% | |
| 7 | 6633 | 4.4% |
Extra_Type
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| No Extras | |
|---|---|
| wides | 4153 |
| legbyes | 2357 |
| noballs | 579 |
| Wides | 422 |
| Other values (5) | 685 |
| Value | Count | Frequency (%) | |
| No Extras | 142255 | 94.6% | |
| wides | 4153 | 2.8% | |
| legbyes | 2357 | 1.6% | |
| noballs | 579 | 0.4% | |
| Wides | 422 | 0.3% | |
| byes | 379 | 0.3% | |
| Legbyes | 233 | 0.2% | |
| Noballs | 39 | < 0.1% | |
| Byes | 33 | < 0.1% | |
| penalty | 1 | < 0.1% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.822015141 |
| Min length | 4 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.22219859 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 61151 |
| Zeros (%) | 40.6% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.594310883 |
|---|---|
| Coefficient of variation (CV) | 1.3044614 |
| Kurtosis | 1.697061955 |
| Mean | 1.22219859 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.596954927 |
| Sum | 183881 |
| Variance | 2.541827193 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 61151 | 40.6% | |
| 1 | 55495 | 36.9% | |
| 4 | 17026 | 11.3% | |
| 2 | 9705 | 6.5% | |
| 6 | 6520 | 4.3% | |
| 3 | 509 | 0.3% | |
| 5 | 45 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 61151 | 40.6% | |
| 1 | 55495 | 36.9% | |
| 2 | 9705 | 6.5% | |
| 3 | 509 | 0.3% | |
| 4 | 17026 | 11.3% |
| Value | Count | Frequency (%) | |
| 6 | 6520 | 4.3% | |
| 5 | 45 | < 0.1% | |
| 4 | 17026 | 11.3% | |
| 3 | 509 | 0.3% | |
| 2 | 9705 | 6.5% |
Out_type
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Not Applicable | |
|---|---|
| caught | 3678 |
| bowled | 1382 |
| run out | 755 |
| Keeper Catch | 695 |
| Other values (6) | 928 |
| Value | Count | Frequency (%) | |
| Not Applicable | 143013 | 95.1% | |
| caught | 3678 | 2.4% | |
| bowled | 1382 | 0.9% | |
| run out | 755 | 0.5% | |
| Keeper Catch | 695 | 0.5% | |
| lbw | 455 | 0.3% | |
| stumped | 243 | 0.2% | |
| caught and bowled | 211 | 0.1% | |
| retired hurt | 9 | < 0.1% | |
| hit wicket | 9 | < 0.1% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 21 |
|---|---|
| Median length | 14 |
| Mean length | 13.645898 |
| Min length | 3 |
| Distinct | 450 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 4/23/2009 | 513 |
|---|---|
| 4/29/2017 | 508 |
| 05-05-2012 | 506 |
| 4/16/2013 | 506 |
| 5/19/2014 | 503 |
| Other values (445) |
| Value | Count | Frequency (%) | |
| 4/23/2009 | 513 | 0.3% | |
| 4/29/2017 | 508 | 0.3% | |
| 05-05-2012 | 506 | 0.3% | |
| 4/16/2013 | 506 | 0.3% | |
| 5/19/2014 | 503 | 0.3% | |
| 4/29/2009 | 502 | 0.3% | |
| 3/25/2010 | 502 | 0.3% | |
| 3/21/2010 | 501 | 0.3% | |
| 5/17/2009 | 501 | 0.3% | |
| 4/16/2017 | 500 | 0.3% | |
| Other values (440) | 145409 | 96.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.371569481 |
| Min length | 9 |
| Distinct | 460 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 136.5370386 |
|---|---|
| Minimum | 1 |
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 40 |
| median | 96 |
| Q3 | 208 |
| 95-th percentile | 376 |
| Maximum | 497 |
| Range | 496 |
| Interquartile range (IQR) | 168 |
Descriptive statistics
| Standard deviation | 120.5342396 |
|---|---|
| Coefficient of variation (CV) | 0.8827951797 |
| Kurtosis | -0.3033486636 |
| Mean | 136.5370386 |
| Median Absolute Deviation (MAD) | 75 |
| Skewness | 0.8738477794 |
| Sum | 20542134 |
| Variance | 14528.50291 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 8 | 3494 | 2.3% | |
| 40 | 3433 | 2.3% | |
| 21 | 3369 | 2.2% | |
| 57 | 3274 | 2.2% | |
| 42 | 3005 | 2.0% | |
| 46 | 2960 | 2.0% | |
| 187 | 2902 | 1.9% | |
| 20 | 2680 | 1.8% | |
| 85 | 2602 | 1.7% | |
| 162 | 2531 | 1.7% | |
| Other values (450) | 120201 | 79.9% |
| Value | Count | Frequency (%) | |
| 1 | 1326 | 0.9% | |
| 2 | 2181 | 1.4% | |
| 3 | 129 | 0.1% | |
| 4 | 1101 | 0.7% | |
| 5 | 84 | 0.1% |
| Value | Count | Frequency (%) | |
| 497 | 12 | < 0.1% | |
| 496 | 26 | < 0.1% | |
| 495 | 11 | < 0.1% | |
| 491 | 31 | < 0.1% | |
| 490 | 2 | < 0.1% |
Non_Striker
Real number (ℝ≥0)
| Distinct | 457 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 135.6234189 |
|---|---|
| Minimum | 1 |
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 40 |
| median | 96 |
| Q3 | 208 |
| 95-th percentile | 375 |
| Maximum | 497 |
| Range | 496 |
| Interquartile range (IQR) | 168 |
Descriptive statistics
| Standard deviation | 120.0704115 |
|---|---|
| Coefficient of variation (CV) | 0.8853221104 |
| Kurtosis | -0.2513749151 |
| Mean | 135.6234189 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 0.8946355224 |
| Sum | 20404679 |
| Variance | 14416.90371 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 40 | 3635 | 2.4% | |
| 21 | 3483 | 2.3% | |
| 8 | 3350 | 2.2% | |
| 57 | 3306 | 2.2% | |
| 42 | 3248 | 2.2% | |
| 46 | 2848 | 1.9% | |
| 85 | 2831 | 1.9% | |
| 187 | 2672 | 1.8% | |
| 162 | 2458 | 1.6% | |
| 20 | 2432 | 1.6% | |
| Other values (447) | 120188 | 79.9% |
| Value | Count | Frequency (%) | |
| 1 | 1383 | 0.9% | |
| 2 | 2263 | 1.5% | |
| 3 | 142 | 0.1% | |
| 4 | 1129 | 0.8% | |
| 5 | 69 | < 0.1% |
| Value | Count | Frequency (%) | |
| 497 | 14 | < 0.1% | |
| 496 | 35 | < 0.1% | |
| 495 | 10 | < 0.1% | |
| 491 | 37 | < 0.1% | |
| 490 | 1 | < 0.1% |
Bowler
Real number (ℝ≥0)
| Distinct | 355 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 194.0870981 |
|---|---|
| Minimum | 1 |
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 77 |
| median | 174 |
| Q3 | 310 |
| 95-th percentile | 416 |
| Maximum | 497 |
| Range | 496 |
| Interquartile range (IQR) | 233 |
Descriptive statistics
| Standard deviation | 132.9989497 |
|---|---|
| Coefficient of variation (CV) | 0.6852539453 |
| Kurtosis | -1.066164182 |
| Mean | 194.0870981 |
| Median Absolute Deviation (MAD) | 107 |
| Skewness | 0.3924450847 |
| Sum | 29200598 |
| Variance | 17688.72063 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 50 | 2989 | 2.0% | |
| 136 | 2703 | 1.8% | |
| 194 | 2694 | 1.8% | |
| 14 | 2636 | 1.8% | |
| 67 | 2594 | 1.7% | |
| 201 | 2359 | 1.6% | |
| 15 | 2276 | 1.5% | |
| 81 | 2161 | 1.4% | |
| 94 | 2159 | 1.4% | |
| 29 | 2113 | 1.4% | |
| Other values (345) | 125767 | 83.6% |
| Value | Count | Frequency (%) | |
| 1 | 280 | 0.2% | |
| 4 | 323 | 0.2% | |
| 5 | 63 | < 0.1% | |
| 8 | 264 | 0.2% | |
| 9 | 1799 | 1.2% |
| Value | Count | Frequency (%) | |
| 497 | 182 | 0.1% | |
| 495 | 111 | 0.1% | |
| 494 | 21 | < 0.1% | |
| 493 | 82 | 0.1% | |
| 492 | 24 | < 0.1% |
| Distinct | 435 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 143013 |
| Missing (%) | 95.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 148.6332347 |
|---|---|
| Minimum | 1 |
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 41 |
| median | 107 |
| Q3 | 236 |
| 95-th percentile | 391 |
| Maximum | 497 |
| Range | 496 |
| Interquartile range (IQR) | 195 |
Descriptive statistics
| Standard deviation | 124.7608827 |
|---|---|
| Coefficient of variation (CV) | 0.839387523 |
| Kurtosis | -0.6106458837 |
| Mean | 148.6332347 |
| Median Absolute Deviation (MAD) | 80 |
| Skewness | 0.7270238885 |
| Sum | 1105534 |
| Variance | 15565.27786 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 21 | 134 | 0.1% | |
| 40 | 131 | 0.1% | |
| 57 | 129 | 0.1% | |
| 46 | 128 | 0.1% | |
| 8 | 118 | 0.1% | |
| 88 | 117 | 0.1% | |
| 42 | 109 | 0.1% | |
| 17 | 107 | 0.1% | |
| 27 | 101 | 0.1% | |
| 187 | 100 | 0.1% | |
| Other values (425) | 6264 | 4.2% | |
| (Missing) | 143013 | 95.1% |
| Value | Count | Frequency (%) | |
| 1 | 53 | < 0.1% | |
| 2 | 98 | 0.1% | |
| 3 | 9 | < 0.1% | |
| 4 | 49 | < 0.1% | |
| 5 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 497 | 1 | < 0.1% | |
| 496 | 3 | < 0.1% | |
| 495 | 3 | < 0.1% | |
| 491 | 2 | < 0.1% | |
| 489 | 2 | < 0.1% |
| Distinct | 400 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 145100 |
| Missing (%) | 96.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 155.3918894 |
|---|---|
| Minimum | 1 |
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 47 |
| median | 111 |
| Q3 | 237.5 |
| 95-th percentile | 386.5 |
| Maximum | 497 |
| Range | 496 |
| Interquartile range (IQR) | 190.5 |
Descriptive statistics
| Standard deviation | 125.126355 |
|---|---|
| Coefficient of variation (CV) | 0.8052309261 |
| Kurtosis | -0.6340701806 |
| Mean | 155.3918894 |
| Median Absolute Deviation (MAD) | 81 |
| Skewness | 0.6972704448 |
| Sum | 831502 |
| Variance | 15656.60471 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 88 | 127 | 0.1% | |
| 20 | 126 | 0.1% | |
| 46 | 115 | 0.1% | |
| 110 | 103 | 0.1% | |
| 21 | 96 | 0.1% | |
| 17 | 84 | 0.1% | |
| 183 | 82 | 0.1% | |
| 57 | 79 | 0.1% | |
| 53 | 75 | < 0.1% | |
| 8 | 74 | < 0.1% | |
| Other values (390) | 4390 | 2.9% | |
| (Missing) | 145100 | 96.4% |
| Value | Count | Frequency (%) | |
| 1 | 23 | < 0.1% | |
| 2 | 50 | < 0.1% | |
| 3 | 4 | < 0.1% | |
| 4 | 29 | < 0.1% | |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 497 | 5 | < 0.1% | |
| 496 | 2 | < 0.1% | |
| 495 | 1 | < 0.1% | |
| 491 | 3 | < 0.1% | |
| 490 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| MatcH_id | Over_id | Ball_id | Innings_No | Team_Batting | Team_Bowling | Striker_Batting_Position | Extra_Type | Batsman_Runs_Scored | Out_type | Match_Date | Striker | Non_Striker | Bowler | Player_Out | Fielders | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 598028 | 15 | 6 | 1 | 5 | 2 | 6.0 | No Extras | 4 | Not Applicable | 4/20/2013 | 277 | 104 | 83 | NaN | NaN |
| 1 | 598028 | 14 | 1 | 1 | 5 | 2 | 5.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 104 | 6 | 346 | NaN | NaN |
| 2 | 598028 | 14 | 2 | 1 | 5 | 2 | 3.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 6 | 104 | 346 | NaN | NaN |
| 3 | 598028 | 14 | 3 | 1 | 5 | 2 | 5.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 104 | 6 | 346 | NaN | NaN |
| 4 | 598028 | 14 | 4 | 1 | 5 | 2 | 3.0 | No Extras | 0 | Not Applicable | 4/20/2013 | 6 | 104 | 346 | NaN | NaN |
| 5 | 598028 | 14 | 5 | 1 | 5 | 2 | 3.0 | No Extras | 4 | Not Applicable | 4/20/2013 | 6 | 104 | 346 | NaN | NaN |
| 6 | 598028 | 14 | 6 | 1 | 5 | 2 | 3.0 | No Extras | 2 | Not Applicable | 4/20/2013 | 6 | 104 | 346 | NaN | NaN |
| 7 | 598028 | 13 | 1 | 1 | 5 | 2 | 5.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 104 | 6 | 83 | NaN | NaN |
| 8 | 598028 | 13 | 2 | 1 | 5 | 2 | 3.0 | No Extras | 4 | Not Applicable | 4/20/2013 | 6 | 104 | 83 | NaN | NaN |
| 9 | 598028 | 13 | 3 | 1 | 5 | 2 | 3.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 6 | 104 | 83 | NaN | NaN |
Last rows
| MatcH_id | Over_id | Ball_id | Innings_No | Team_Batting | Team_Bowling | Striker_Batting_Position | Extra_Type | Batsman_Runs_Scored | Out_type | Match_Date | Striker | Non_Striker | Bowler | Player_Out | Fielders | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 150441 | 598028 | 16 | 2 | 1 | 5 | 2 | 5.0 | No Extras | 0 | Keeper Catch | 4/20/2013 | 104 | 277 | 81 | 104.0 | 239.0 |
| 150442 | 598028 | 16 | 3 | 1 | 5 | 2 | 7.0 | No Extras | 0 | Not Applicable | 4/20/2013 | 310 | 277 | 81 | NaN | NaN |
| 150443 | 598028 | 16 | 4 | 1 | 5 | 2 | 7.0 | No Extras | 2 | Not Applicable | 4/20/2013 | 310 | 277 | 81 | NaN | NaN |
| 150444 | 598028 | 16 | 5 | 1 | 5 | 2 | 7.0 | No Extras | 0 | Not Applicable | 4/20/2013 | 310 | 277 | 81 | NaN | NaN |
| 150445 | 598028 | 16 | 6 | 1 | 5 | 2 | 7.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 310 | 277 | 81 | NaN | NaN |
| 150446 | 598028 | 15 | 1 | 1 | 5 | 2 | 5.0 | No Extras | 1 | Not Applicable | 4/20/2013 | 104 | 6 | 83 | NaN | NaN |
| 150447 | 598028 | 15 | 2 | 1 | 5 | 2 | 3.0 | No Extras | 2 | Not Applicable | 4/20/2013 | 6 | 104 | 83 | NaN | NaN |
| 150448 | 598028 | 15 | 3 | 1 | 5 | 2 | 3.0 | No Extras | 4 | Not Applicable | 4/20/2013 | 6 | 104 | 83 | NaN | NaN |
| 150449 | 598028 | 15 | 4 | 1 | 5 | 2 | 3.0 | No Extras | 0 | caught | 4/20/2013 | 6 | 104 | 83 | 6.0 | 349.0 |
| 150450 | 598028 | 15 | 5 | 1 | 5 | 2 | 6.0 | No Extras | 0 | Not Applicable | 4/20/2013 | 277 | 104 | 83 | NaN | NaN |